Which Network Similarity Measure Should You Choose: An Empirical Study
نویسندگان
چکیده
We consider the problem of determining how similar two networks, without known node-correspondences, are. This problem occurs frequently in real-world applications like transfer learning and change detection. Many network similarity measures exist, and it is unclear how one might select from amongst them. We provide the first empirical study on the relationships between different network similarity methods. Here, we propose (1) an approach for identifying groups of comparable network similarity methods and (2) an approach for computing the consensus among a given set of network similarity methods. We apply our approaches to seven real datasets and twenty network similarity methods. Our experiments demonstrate that (a) different network similarity methods are surprisingly well correlated, (b) some complex network similarity methods can be very closely approximated by much simpler methods, and (c) two network similarity methods–namely, random walk with restarts and NetSimile–provide similarity rankings that are closest to the consensus ranking.
منابع مشابه
An Empirical Comparison of Distance Measures for Multivariate Time Series Clustering
Multivariate time series (MTS) data are ubiquitous in science and daily life, and how to measure their similarity is a core part of MTS analyzing process. Many of the research efforts in this context have focused on proposing novel similarity measures for the underlying data. However, with the countless techniques to estimate similarity between MTS, this field suffers from a lack of comparative...
متن کاملLink Prediction using Network Embedding based on Global Similarity
Background: The link prediction issue is one of the most widely used problems in complex network analysis. Link prediction requires knowing the background of previous link connections and combining them with available information. The link prediction local approaches with node structure objectives are fast in case of speed but are not accurate enough. On the other hand, the global link predicti...
متن کاملAn improved similarity measure of generalized trapezoidal fuzzy numbers and its application in multi-attribute group decision making
Generalized trapezoidal fuzzy numbers (GTFNs) have been widely applied in uncertain decision-making problems. The similarity between GTFNs plays an important part in solving such problems, while there are some limitations in existing similarity measure methods. Thus, based on the cosine similarity, a novel similarity measure of GTFNs is developed which is combined with the concepts of geometric...
متن کاملPresenting a Hybrid Approach based on Two-stage Data Envelopment Analysis to Evaluating Organization Productivity
Measuring the performance of a production system has been an important task in management for purposes of control, planning, etc. Lord Kelvin said :“When you can measure what you are speaking about, and express it in numbers, you know something about it; but when you cannot measure it, when you cannot express it in numbers, your knowledge is of a meager and unsatisfactory kind.” Hence, manag...
متن کاملEigenvalue Sensitive Feature Selection
In recent years, some spectral feature selection methods are proposed to choose those features with high power of preserving sample similarity. However, when there exist lots of irrelevant or noisy features in data, the similarity matrix constructed from all the un-weighted features may be not reliable, which then misleads existing spectral feature selection methods to select ’wrong’ features. ...
متن کامل